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Abstract — Feedback communication is studied from a control- 
theoretic perspective, mapping the communication problem to a 
control problem in which the control signal is received through 
the same noisy channel as in the communication problem, 
and the (nonlinear and time-varying) dynamics of the system 
determine a subclass of encoders available at the transmitter. 
The MMSE capacity is defined to be the supremum exponential 
decay rate of the mean square decoding error. This is upper 
bounded by the information-theoretic feedback capacity, which 
is the supremum of the achievable rates. A sufficient condition 
is provided under which the upper bound holds with equality. 
For the special class of stationary Gaussian channels, a simple 
application of Bode's integral formula shows that the feedback 
capacity, recently characterized by Kim, is equal to the maximum 
instability that can be tolerated by the controller under a given 
power constraint. Finally, the control mapping is generalized to 
the TV-sender AWGN multiple access channel. It is shown that 
Kramer's code for this channel, which is known to be sum rate 
optimal in the class of generalized linear feedback codes, can be 
obtained by solving a linear quadratic Gaussian control problem. 



I. Introduction 

Feedback loops are central to many engineering systems. 
Their study naturally falls at the intersection between com- 

■ munication and control theories. However, the information- 
theoretic approach and the control-theoretic one have often 
evolved in isolation, separated by almost philosophical differ- 
ences. In this paper we attempt one step at bridging the gap, 
showing how tools from both disciplines can be applied to 
study fundamental limits of feedback systems and to design 

■ efficient codes for communication in the presence of feedback. 

Consider the feedback communication problem over an 
' arbitrary point-to-point channel depicted in Fig la. The en- 
coder, which has access to the channel outputs causally and 
noiselessly, wishes to communicate a continuous message 
M e (0, 1) to the decoder through n channel uses. At the 
end of the transmissions, the decoder forms an estimate M 
based on the received channel outputs, and the mean square 
error (MSE) of the estimate M represents the performance 
metric of the communication. 

We map this communication problem to the general {non- 
linear and time varying) controlled dynamical system depicted 
in Fig. [T}5, in which the initial state of the system corresponds 
to the message M, and the control actions-received through 
the same noisy channel as in the communication problem- 
correspond to the transmitted signals by the encoder In 
this representation, the set of controllers for a given system 
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corresponds to a subclass of encoders for the communication 
problem. In fact, the system can be viewed as a filter which de- 
termines the information pattern \ Y\, on which the transmitted 
signals (actions) by the encoder (controller) can depend upon. 
A similar mapping for the special case of linear time-invariant 
(LTI) systems and controllers was first presented in |2|. 

We study three different channel models. First, we con- 
sider a general point-to-point channel. The MSE exponent 
is defined as the exponential decay rate of the MSE in 
block length n, and the (feedback) minimum mean square 
error (MMSE) capacity is defined as the supremum of all 
achievable MSE exponents with feedback. We show that 
the MMSE capacity is upper bounded by the information- 
theoretic (feedback) capacity, the supremum of all achievable 
rates with feedback. We also present a sufficient condition, 
under which the (information-theoretic) capacity coincides 
with the MMSE capacity. These results provide a step towards 
the understanding of the connection between estimation and 
information theory. 

Second, we focus on the stationary Gaussian channel with 
feedback, the capacity of which was recently characterized by 
Kim I?]. Applying the discrete extension of Bode's result Q 
(cf. Q, Q), we observe that the capacity of the Gaussian 
channel under power constraint P is equal to the maximum 
instability which can be tolerated by a linear controller with 
power at most P, acting over the same stationary Gaussian 
channel. This follows almost immediately from the previous 
results II2I, Q and provides a step towards the understanding 
of the connection between stabilizability over some noisy 
channel and the capacity of that channel. 

Finally, we consider the iV-sender additive white Gaussian 
noise (AWGN) multiple access channel (MAC) with feedback 
depicted in Fig. |2^. We show that the linear code proposed by 
Kramer |7|, which is known to be optimal among the class of 
generalized linear feedback codes |8|, can be obtained as the 
optimal solution of a linear quadratic Gaussian (LQG) problem 
given by a linear time-invariant (LTI) system controlled over 
a point-to-point AWGN channel where the asymptotic cost is 
the average power of the controller. These results provide a 
step towards the understanding of how control tools can be 
used to design codes for communication. 

We now wish to place our results in the context of the 
related literature. The results on the iV-sender AWGN-MAC 
generalize previous ones of Elia |2|, who recovers Ozarow's 
code |9|-a special case of Kramer's code-using control theory. 
Our approach is different from Elia's in both the model and the 
analysis. Our reduction is to a control problem over a single 
point-to-point channel for any N > 2 number of senders, 
and our analysis is based on the theory of LQG control. In 
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contrast, in ||2l the communication is limited to 2-sender MAC, 
which is mapped to a control problem also over a 2-sender 
MAC, and the analysis is based on the technique of Youla 
parameterization. 

The connection between the MMSE and capacity has been 
investigated extensively and from different perspectives in the 
literature. For example, in a classic paper Duncan IfTOl ex- 
presses the mutual information between a continuous random 
process and its noisy version corrupted by white noise, in terms 
of the causal MMSE. More recently, Forney ifTTl explains the 
role of the MMSE in the context of capacity achieving lattice 
codes over AWGN channels. Guo et al. |12| and Zakai ifTSi 
showed that for a discrete random vector observed through 
an AWGN channel, the derivative of the mutual information 
between input and output sequences with respect to the signal- 
to-noise ratio (SNR), is half the (noncausal) MMSE. We point 
out that these authors study the average MMSE of a vector 
observed over a noisy channel without feedback as a function 
of the SNR. In contrast, we consider the estimation of a 
single random variable (message), given the observation of 
a whole block of length n, and we look at the exponential 
decay rate of the MMSE with n, at fixed SNR. Of more 
relevance to us is the recent work of Liu and Elia fT4], who 
study linear codes over Gaussian channels obtained using a 
Kalman filter (KF) approach. For this class of codes, they 
show that the decay rate of the MMSE equals the mutual 
information between the message and the output sequence. 
In contrast, our results for the MMSE capacity are derived 
based on an information-theoretic approach and hold for all 
codes over general channels. 

Additional works in the literature revealed connections 
between control theory and information theory. Without at- 
tempting of being exhaustive, we distinguish between those 
who use information theory to study control systems and 
those who use control theory to study communication systems. 
Within the first group, Mitter and Newton fl^, fW^ studied 
estimation and filtering in terms of information and entropy 
flows. In the last decade. Bode-like fundamental limitations in 
controlled systems have been analyzed with success from an 
information-theoretic perspective IHl, IHl, IHl, ||20l, ET]. 
In this context, we point out that our Lemma [5] when it 
is specialized to the case of additive channels, provides an 
alternative proof of Theorem 4.2 in |20|. 

Within the second group, Elia |2) was the first to map 
linear codes for additive white Gaussian noise channels to an 
LTI system controlled over an AWGN channel. Subsequently, 
Wu et al. 1 22 1 studied the Gaussian interference channel in 
terms of estimation and control. Tatikonda and Mitter ||231 
used a Markov decision problem (MDP) formulation to study 
the capacity of Markov channels with feedback, and recently 
Coleman | j24) considered the design of the feedback encoder 
from a stochastic control perspective. Finally, we also refer 
the reader to the work in |25|, which gives an historical 
perspective and contains selected additional references. 

The rest of the paper is organized as follows. Section HI] 
presents the definitions and the mapping between the feedback 
communication and the control problem. Section Hill provides 
the upper bound on the MMSE capacity for a general point-to- 



point channel. The point-to-point stationary Gaussian channels 
and the AWGN multiple access channel are considered in 
Section |IV] and Section |Vl respectively. Finally, Section |VI] 
concludes the paper. 

Notation: A random variable is denoted by an upper case 
letter (e.g. X, Y, Z) and its realization is denoted by a lower 
case letter (e.g. x, y, z). Similarly, a random column vector 
and its realization are denoted by bold face symbols (e.g. 
X and X, respectively). Uppercase letters (e.g. A, B, C) also 
denote matrices, which can be differentiated from a random 
variable based on the context. The (i,j)-th element of A 
is denoted by Aij, and notation A^ and A' denote the 
transpose and complex transpose of matrix A, respectively. 
We use the following short notation for covariance matrices 
i^XY := E(XY') - E(X) E(Y') and := i^xx- 

II. Definitions and Control Approach 

Consider the communication problem depicted in Fig. [T^, 
where the sender wishes to convey a message A/ e 
(0, 1) to the receiver through n uses of a stochastic channel, 

Yi^hi{Xi,Zi), i = l,...,n. 

where Xi E X and Yi E y denote the input and output of the 
channel, respectively, and Zi E Z denote the noise at time i. 
The set of mappings 

hi : X X Z ^ y, i ~ 1, . . . ,n 

and the distribution of the noise sequence {ZiYl^i determine 
the channel. The noise process {Zi\ is assumed to be indepen- 
dent of the message Al. We assume that the output symbols are 
causally fed back to the sender and the transmitted symbol Xi 
at time i can depend on both the previous channel output 
sequence {Yi, 1^2, ^i-i} and the message M. 

Definition 1: We define a (feedback) n-code as 

1) an encoder: a set of (stochasticj^ encoding maps fi : 
M X y^^^ — > y, also known to the receiver, such that 
for each i = 1, . . . ,n 

X, = /,(Af,r'-i) (1) 

and 

2) a decoder: a decoding map (p ■ y" ^ which 
determines the estimate of the message M based on the 
received sequence F", i.e., 

M = (2) 

We assume that the message Al e (0, 1) is a random variable 
uniformly distributed over the unit interval and does not de- 
pend on n. As the performance measure of the communication, 
we consider the MSE, 

£)("':= E((Af-(/)(r"))2). (3) 

where the expectation is with respect to randomness of both 
the message and the channel. Note that the decoder does not 
affect the joint distribution of (M, X",y",Z") and simply 

'For stochastic encoders we can write Xi as a function of {M, Y'~^ , Vi), 
where {Vi} is a random process independent of M and {Zi}. 
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Fig. 1. An arbitrary encoder for the feedback communication over a stochastic channel depicted in (a) can be represented as a dynamical system controlled 
over the same channel depicted in (b). 



estimates the message at the end of the block. Hence, without 
loss of generality, we pick the optimal decoder, namely, the 
MMSE estimator of the message given F", and we call an 
encoder optimal if it minimizes Let 

-^iog(i:)(")) 

2n 

be the exponential decay rate of the MSB with respect to n. 

Definition 2: The MSE exponent E is called achievable 
(with feedback) if there exists a sequence of n-codes such 
that 

£;< liminf (4) 

n— >-C30 

The MMSE capacity Cm MS e is the supremum of all achiev- 
able MSE exponents. 

The communication problem described above can be viewed 
as a control problem where the encoder wishes to control 
the knowledge of the receiver about the message. In his 
notable paper ll26l . Witsenhausen wrote: "When communi- 
cation problems are considered as control problems (which 
they are), the information pattern is never classical since 
at least two stations, not having access to the same data, 
are always involved". However, this is not the case for the 
feedback communication problem in Fig. [TH since the encoder 
(controller) has access also to the information available at 
the decoder via feedback. In fact, below we show that this 
feedback communication problem can be represented as a 
control problem in which the controller has the complete state 
information. 

Consider the control problem in Fig. [TJ) where the state at 
time i is 

Si ^ gi{Si-i,Yi_i), i = (5) 
with initial state 

So = M 



and Yq = 0. We refer to the mappings 

: Si-i xy ^ Si, i = l,...,n 

as the system. The controller, which observes the current 
state Si, picks an action (symbol) Xi £ X, 

Xi^TTi{Si), i = l,...,n (6) 

according to a set of (stochastic) mappings 

TTi : 5i — > A", i = 1, . . . , n. 

We refer to the set {vTi}"^]^ as the controller. 

The communication problem in Fig. [T^ can be represented 
as the control problem Fig. [H? as follows. Let the system 
{gi}i=i be such that the state Si at time i is the collection 
of the initial state Sq = M and the past observations Y"^^^, 
namely, 

5, = (M,r*-i) i = l,...,n. (7) 

Also, let the controller {tt^}"^]^ be picked according to the 
encoder in the communication problem such that 

^l^{Si)^ n{M,Y'-^), i = l,...,n. 

Then the joint distribution of all the random variables 
(M, in the control problem is the same as that 

in the communication problem. 

To complete the representation, let S'o(l"") be the MMSE 
estimate of the initial state 5*0 based on F", and the final cost 
be 

c„(5„) (5o-5o(y"))^ 

We call a controller optimal if it minimizes the final expected 
cost 

E(c„(5„)) = E((M-M(y"))2) 
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Thus, the optimal controller represents the optimal encoder for 
the communication problem. 

The system in (IT) is the most general system which can 
represent all the encoders for the communication system. 
However, if the system {gi}^^i is more restricted such that 
the state Si is a filtered version of (5o,l^*^^), then the con- 
troller {TTi}2=i represents only a subclass of encoders {fi}2=i 
where the transmitted symbol Xi depends on {M, Y^~^) only 
through Si (see (|6]l). In that case, we can view the system 
as a filter which determines the information pattern available 
at the controller (encoder). Below, we show a subclass of 
encoders for which the state Si does not include all the past 
output as in yet it contains all the optimal encoders. 
Let F]^,f\Y^-^{'\Y^~^) be the conditional distribution of the 
message M given the channel outputs F'^^. 

Lemma 1: An optimal encoder which minimizes the MSB 
Z?("\ can be found in the subclass of encoders which is 
determined by the system with state of the form Si = 
{M,Fmiy'-A-\Y'-^)). 

Proof: See Appendix lAl 

This lemma, which is based on a well-known result in 
stochastic control, provides a sufficient information pattern 
such that the optimal feedback encoders for the communi- 
cation over a general channel can be built upon. For example, 
in the special case of memoryless channels, Shayevitz and 
Feder |27| proposed an explicit encoder which uses the 
information pattern described in Lemma [T] and showed that it 
is optimal in terms of the achievable rates. 

III. MMSE Capacity 

In this section we present the relationship between the 
MMSE capacity and the information-theoretic capacity ll28l . 

First, we review the definition of the information-theoretic 
achievable rates, which is an asymptotic measure based on a 
sequence of message sets whose size depend on the block 
length n. Consider the n-code in Definition [T] and let the 
message A/„ ^ Unif(A^„) be uniformly distributed over the 
set Mn ■■= {1,2,..., 2"^} such that 



z = l,. 



(8) 



Let probabiUty of error be Pi"' = P(Af„ ^ A/„). A rate R 
is called achievable (with feedback) if there exists a sequence 

(n) 

of n-codes with message sets for which Pe — > as 
n ^ oo. The (feedback) capacity C is the supremum of all 
achievable rates. 

Recall that the MSB exponent E is defined based on a mes- 
sage Me (0, 1) which does not depend on the block length n. 
The following lemma provides the connection between the 
achievable rates and MSB exponents. 

Lemma 2: If the MSB exponent E is achievable, then any 
rate R < E is achievable. Conversely, if the rate R is achiev- 

able and the probability of error satisfies lim„_s.oo 2 -Inn. — 0, 
then the MSB exponent E = R is achievable. 
Proof: See Appendix IbI 
According to Lemma IH showing that an MSB exponent 
E > R is achievable for a uniform message M e (0, 1) is 
sufficient to show that rate R is achievable. 



Below, we provide a general upper bound on the MMSB 
capacity. Before proceeding, we need some information- 
theoretic definitions |28|. Let h{X) denote the differential 
entropy of a random variable X and I{X;Y) be the mutual 
information between random variables X and Y. Moreover, 
let the minimum entropy rate of a general random process 
be defined as 

h{X) i^liminf^^^^^. 

n— foo Ti 

For a stationary random process, the limit exists 
entropy rate is defined as 



and the 



h{X) 



: lim 

n— f 00 



We call a channel invertible if Zi can be recovered from 
(Xi, Yi), namely, for every time i > there exists a function 



h„ ^ such that 



Z,^h-\X,,Y,). 



Lemma 3: For a system controlled over an invertible chan- 
nel, we have 

/(5o;r") <h{Y")-h{Z") 

where equality holds if and only if Xi is a deterministic 
function of (Sq, for i = 1, . . . , n. 

Proof: See Appendix ICl 

Remark 1: Theorem 4.2 in |20| can be recovered from 
Lemma [3] by considering the special case of additive channels 
Yi = Xi + Zi which are invertible. 

Theorem 1: For an invertible channel, the MMSB capacity 
Cm MS E is upper bounded as 



C 



MMSE 



< C < sup h{Y)-h{Z). 



where the supremum is over all sequences of n-codes. 

Remark 2: Theorem[T] holds without any assumption on the 
noise sequence or the controlled system. 
Proof: 

The first inequality 



Cmmse < C 



(9) 



can be shown using Lemma |2l Suppose (|9|l does not hold and 
C < Cmmse- Then, we can always find an achievable MSB 
exponent E* such that C < E*, and subsequently we can pick 
a rate R* such that C < R* < E* . Hence, by Lemma |2] the 
rate R* is achievable, and this contradicts the definition of C. 
Therefore, the inequality ^ must hold. 

For the second inequality, from Fano's inequality we have 

n 

where e„ — > as n — > 00. Hence, for a given sequence of 
n-codes 



R < liminf i/(S'o,r"). 

n— ^00 71 



(10) 



From LemmaSwe have /i(Z") + /(S'o; F") < Taking 
the limit and rearranging terms we get 

liminf-/(S'o,r") <h{Y)-h(Z). 

n— J-oo n 
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Combining with ( fTOl ) we have R < h{Y) — h{Z), and taking 
the supremum over all sequences of n-codes completes the 
proof. ■ 

Theorem [T] shows that in general, (feedback) MMSE capac- 
ity is upper bounded by the (feedback) capacity. Below, we 
provide a sufficient condition on the channel under which this 
bound is tight. 

Corollary 1: For a given channel, we have 



C 



MMSE 



= c 



if any rate R < C can be achieved such that 



lim 



^ 0. 



(11) 



Proof: By assumption for any rate R < C, the condition 
on the decay rate of probability of error Pi"^ in Lemma |2] 
is satisfied and hence any MSB exponent E — R < C is 
achievable. Therefore, we conclude that Cmmse > G. On 
the other hand, by Theorem [T] we have Cmmse < C, and 
hence Cmmse =^C. ■ 
In the next section we show that this equality holds for 
some special cases of Gaussian channels including the AWGN 
channel. 

IV. PoiNT-TO-PoiNT Gaussian Channels 

In this section, we turn our attention to additive Gaussian 
channels, 

Yi = Xi + Zi 

where the noise sequence {Zi] is a (colored) stationary 
Gaussian process with power spectral density Sz{e-''^). The 
transmitted symbols are assumed to satisfy the (block) power 
constraint P, i.e.. 



We define the subclass of LTI encoders to be the encoders 
which can be represented by an LTI system and a linear 
controller in Fig.[TJ). In this case, the system and the controller 
combined can be represented as as a causal filter 



F{z) 



Y{z) 



where X{z) and Y{z) are the Z-transform of the input and the 
output sequence, respectively. We alternatively refer to F{z) 
as the open loop transfer function. Let the instability / be 

m 

/ = 5]log(|/3,|). 

i=i 

where (ij are the m unstable poles of the open loop transfer 
function F{z). 

Theorem 2: Under power constraint P, the capacity of the 
stationary Gaussian channel is 

C = sup / 



where the supremum is over all causal filters F{z) which 
satisfy the power constraint 



1 

2^ 



P(eJ") 



1 - F{e3'^ 



Szie^'^)duj < P. 



(12) 



Remark 3: Theorem |2] links the two works and is a simple 
consequence of [2|, [3). The Gaussian channel capacity can be 
represented as the limit of a sequence of optimization prob- 
lems (29]. Kim f3l proved that the limit of this sequence can 
be replaced by a single optimization problem, maximizing the 
entropy rate of the output process over all stationary Gaussian 
input processes, which are a causal and linear filtered version 
of the noise process. On the other hand, Elia [2] expressed 
the asymptotic average directed information between the input 
and output process for a stationary linear scheme over a 
Gaussian channel in terms of the Bode integral. We connect 
the capacity, the supremum achievable rate of communication, 
to the supremum tolerable instability over Gaussian channels. 

Proof: The capacity of the stationary Gaussian channel 
under power constraint P is given by |3 , Theorem 4.6] 



(13) 



where the maximization is over all causal filters P(e^") which 
satisfy 

r \B{en?Sz{endi^ < P. (14) 
This characterization can also be viewed as follows O, 



C 



sup h{Y) 



HZ) 



(15) 



where the supremum is over all stationary Gaussian processes 
{Xi} of the form Xi = hZi-k such that E{Xf) < P. 

We show that the capacity expression ( fTSI l can be rewritten 
in terms of the instability which can be tolerated under the 
power budget. Let the sensitivity function [3P| be the transfer 
function from the output to the noise. 



S{z) 



Y{z) 

z{zy 



(16) 



By discrete version of Bode's integral ID we know that if the 
closed loop is stable then 

-/ log(|5(e^-)|)dc. = ^log(|/3,|)=/. (17) 

Let Syie^'^) and Sz{e-''^) be the power spectral density of 
the output and the noise sequence, respectively. From ( fT6b we 
have 



\S(,en\ 



Sz{eJ^ 

and plugging ( fTSl ) into ( fTTI i we get 



2ti 



1 ( Syjen 

2 ^\Sz{eJ^) 



dio. 



(18) 



(19) 
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The entropy rate of a stationary Gaussian process 
was shown by Kolmogorov ||28l to be 



h{Y) 



1 

2^ 



log(27reS'Y(eJ'"))da;. 



(20) 



Plugging ( |20l ) into ( fTSl l and comparing with ( fT9l ) we have 

C ~ sup / 

where the supremum is over all causal B{z) such that ( fT4b is 
satisfied. On the other hand, B{z) — can be written in 
terms of F(z) 



^Jrr as follows, 

Y{z) 



B{z) = 



F{z) 
l-F(z)- 



Note that F{z) being causal is equivalent to B{z) being 
causal and vice-versa. Therefore, considering constraint ( fT4l l. 
the supremum can be taken over all causal filters F{z) such 
that 



1 
2^ 



1 - F(e-J'") 



Sz{e^'^)duj < P. 



Theorem|2]states that the capacity of the stationary Gaussian 
channel represents the maximum instability of an LTI system 
which can be stabilized over the Gaussian channel by a linear 
controller with power at most P. 

Next, we consider some special cases of the noise spectrum. 



A. ARMA(l) Noise 

Let the noise process be first-order autoregressive moving- 
average ARMA(l), i.e.. 



1 1 -|_ ae^'- 



(21) 



where a e [-1, 1] and (3 e (-1, 1). 

Corollary 2: For the stationary Gaussian channel with noise 
process ARMA(l) given in (1211 1. and power constraint P, we 
have 



Cmmse = C = sup / 



(22) 



where the supremum is over all causal B{z) such that the 
power constraint ( fT4l i is satisfied. 

Proof: To show the first equality in (|22] |. i.e., Cmmse = 
C, note that the feedback capacity of the Gaussian channel 
with ARMA(l) noise is achieved [3, Theorem 5.3] by the 
Schalkwijk-Kailath scheme with probability of error going 
to zero doubly exponentially fast. Therefore, the sufficient 
condition (fTTT i provided in Section is satisfied and hence 
Cmmse — C. The second equality in (l22T i follows from 
Theorem |2] ■ 
Remark 4: Note that the AWGN channel, for which the 
noise spectrum is white, is a special case of ARMA(l) with 
a = /? = 0. 



B. White Noise 

The following corollary considers the special case of 
AWGN channels. 

Corollary 3: For AWGN channels where the noise is i.i.d. 
and Gaussian, Zi ^ N(0, 1), the transfer function F{z) 



F{z) = -(fi^~l) 



1- (3z- 



which has one pole at /3 = vT+P > 1, achieves the MMSE 
capacity 

CMMSEiP) = C(P) = i l0g(l + P) 

under power constraint P. 

Proof: Note that for the AWGN channel, ^^(e^") = 1, 
and by Corollary |2] we have 

Cmmse{P) = C{P) = i log (1 + P) . 



Moreover, for /3 ~ ^/l + P, the transfer function F{z) has 
instability / — log(/3), hence 

Cmmse{P)--C{P)=I. 

It is left to show that F{z) also satisfies the power con- 
straint (fTZt . i.e., 

^ r \B{en\^Sziend^<P (23) 



where 



B{z) = 



F{z) 



p-^z- 



Note that B{z) is same as the optimal filter given in 13] Lemma 
4.1] for the AWGN channel and similarly for /? > 1 we have 

2 



1 

2^ 



1 



1 - p-^e-i'^ 



du 



1 

2^ 



k=0 



k ^~jku 

e 



k=0 



1-/3- 



Hence, 



1 r 






1 - P(eJ") 



1-/3- 



= P 



(24) 



and F{z) satisfy the power constraint. ■ 
Finally, we show that the filter in Corollary [5] corresponds 
to the SchaUcwijk and Kailath scheme ||3T1 . Il32l . The transfer 
function F{z) = in Corollary [3] can be represented in 
the time domain as follows, 

Xi — l3Xi-i — 



/32- 1 
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or 

X,^p {x,^i - ^-^Y,_^ . (25) 

As it was shown earlier, for the transfer function F{z) in 
Corollary [3] we have £{Xf) = P. Hence, £{Y^) = 1 + P 
and 

P'^-l _ P 

/32 ^ 1 + P 

Therefore, dZST l can be written as 

which is the recursive representation of the Schalkwijk and 
Kailath encoder Ii33i . 

V. Gaussian Multiple Access Channel 

In this section, we extend the control representation for 
the communication over point-to-point channels to the feed- 
back communication of N senders and one receiver over the 
AWGN-MAC depicted in Fig.EJi. Each sender j e{l,...,N} 
wishes to reliably transmit a message Mj e M.j := (0,1) in 
the unit interval to the receiver. At each time i, the output of 
the channel is 

JV 

Yi — Xj i + Zi 

where Xji € R is the transmitted symbol by sender j at time i, 
li e M is the output of the channel, and {Zi} is a discrete-time 
zero-mean white Gaussian noise process with unit average 
power, i.e., E(Zf) = 1, independent of Mi, . . . , M^. We 
assume that output symbols are causally fed back to the sender 
and the transmitted symbol Xji for sender j at time i can 
depend on both the message Mj and the previous channel 
output sequence := {Yi,Y2, . . . , Fi-i}. We define a n- 

code for the AWGN-MAC as 

1) N encoders: each encoder j, j = 1, . . . , N, is a set of 
encoding maps fji : (7Wj,3^*~^) — > Xj, i — l,...,ri, 
also known to the receiver, such that 

XJ^^ ^J^{M,,Y'-^) (27) 

and 

2) a decoder: a decoding map : 3^" i x . . . x 
A^AT which determines the estimates of the messages 
Ml,.. .,Mn given F" 

(A^i,...,AfAr) (28) 

We assume that the message vector M :— {Mi, . . ■ , M^) ^ 
Unif ((0, 1)^) is uniformly distributed in a iV dimensional 
unit box and the performance measure is the set of mean 
square errors, 

D^^ := E ((M, - M,(r"))2) j = l,...,N. 



The set of MSE exponents {Ei, . . . , E]y) is called achievable if 
there exists a sequence of n-codes such that for j = 1, . . . , N, 

lim inf - -!- log(D,^"^ )>Ej. 

We say that a sequence of r^-codes has asymptotic powers 
(Pi,...,Piv) if 

limE(4J = P,-, je {!,..., N}. 

Similar to the point-to-point case, the information-theoretic 
achievable rates ||281 are defined based on a sequence of 
massage sets whose size depend on the block length n. Con- 
sider the discrete message sets 7Mj„ :~ {1, 2, . . . , 2"-'^^}, j = 
1, . . . , iV. For a n-code with messages Mjn ^ Unif = 
\,...,X uniformly distributed over the set M^jn, let the 
probability of error be 

Pi") P{(Mi, . . . , Mjv) ^ (Ml, . . . , Mat)}. 

The set of rates (Pi , . . . , Pat) is called achievable under block 
power constraints (Pi, . . . ,Pn) if there exists a sequence of 
71-codes with messages Mj ^ Unif (A^j „), such that for i E 
and j e {1,...,A^}, 

n 

^ E(X2 ) < nP, (29) 

1=1 

and pi"-* — > as n — T' cx). We refer to P = J2jLi 
sum rate of a given code. 

The following lemma presents the connection between the 
achievable MSE exponents and the achievable rates. 

Lemma 4: Let MSE exponents (Pi, . . . , Pat) with asymp- 
totic powers (Pi, . . . , P/v) be achievable and 

R,<E„ P,<P, 

for j — 1, . . . ,N. Then the rate-tuple (Pi, . . . , Rn) is also 
achievable and the block power constraints (Pi, ... , P^) are 
asymptotically satisfied. 

Proof: Let the MSE exponents (Pi, . . . , En) be achiev- 
able. By a similar argument as in Lemma |2] we can see that 
the rates (Pi, . . . , Pat) are also achievable if 

Rj<E,, j^l,...,N. 

Moreover, if P, = lim„^oo E(X^^„), j = 1, . . . , and P, < 
Pj, then using the Cesaro sum we can see that the constraint 

n 

-^E(X2)<P„ jG{l,...,N} 

1=1 

is satisfied for sufficiently large n. ■ 

A. LQG Approach for the AWGN-MAC 

In the following we show that codes can be designed for 
the AWGN-MAC with feedback, based on the LTI systems 
controlled over a point-to-point AWGN channel depicted 
in Fig.|2j3. We refer to these codes as LTI codes for the AWGN- 
MAC. 
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Ml 



Mi 



Mn 

(a) The AWGN-MAC with feedback 



So ~ {Sio, ■ ■ ■ , Sno) 



Encoder 1 



Encoder j 



Encoder A'' 

Xm^ fm{MN,Y"^) 




Controller 

Xi — TVi{Si) 



Z, ~ N(0, 1) 

Y". i 



System 



^ Mx,...,Mn 



(b) Control over AWGN channel 

Fig. 2. The set of TV encoders for feedback communication over the AWGN-MAC depicted in (a) can be represented as a dynamical system depicted in (b). 



Let the state e be a complex vector of length N , 
and the system dynamics be 



(30) 



where 



A 







V 













B = 



1 



V 1 / 

(31) 



such that /?j > 1 are real and jwjl = 1 are distinct points on 
the unit circle of the complex plane. As it is clear from the 
system dynamics given in (|30] |. the dependence of the state on 
the channel outputs is causal . 

The controller is assumed to control each mode j separately 
and transmit a combined control signal (scalar) Xi as follows. 
It observes the current state and picks a vector 



X,: 



Ni) 



where 



(32) 



(33) 



can depend only on Si{j), and then transmits the control signal 

(34) 



N N 



For the purpose of analysis we assume complex transmissions 
over a complex channel. 



Y, ^ X, + Z,, Z, - C7V(0, 1) i.i.d. 



(35) 



where Xi,Yi e C are input and output of the channel, 
respectively, and {Zi} is a discrete-time zero-mean white 
complex Gaussian noise process with identity covariance. 
Note that one transmission over this complex channel can be 
viewed as two transmissions over the real channel. Hence, 
the achievable rates per each complex dimension are also 
achievable over the real channel. 

We assume the complex messages Mj ^ C, j — 1, . . . , N, 
in the AWGN-MAC are uniform over (0, 1) x (0, 1) and we 
set the initial state So G of the system as 

So = M = (A/i,...,Afw). 

Given the system (|30] | and the controller ( |34] |. we derive the 
LTI n-code for the AWGN-MAC as follows. 

Definition 3: An LTI n-code for the AWGN-MAC based 
on LTI system (|30] | and (|34] | consists of 

1) Encoding mappings: The encoder j recursively forms 



s,(j) = /3iL.iS,_iO-) + y,_i, 



1, 



.,n 



(36) 



and at time i transmits Xji — TTji{Si{j)). 
2) Decoding: At the end of the block, the decoder forms the 
estimate vector M„ :— — j4^"S„ where 

S, =AS,_i+Sr,_i, So = 0,^0 = (37) 

and picks M„(j) as the estimate of the message Mj. 
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Note that to design codes for the MAC it is necessary that the 
matrix A is diagonal as in dSTT i and that the control signal is of 
the form ( [34] i since the encoders in the MAC are decentralized 
and do not have access to each other's messages. In the code 
described above, the dynamics of the j-th mode of the system 
Si (j) represents the information based on which the encoder j 
picks the transmitted signal Xji at time i. We say that the 
controller stabilizes the system if 

limsupE(|S„(j)|2) < «), j^l,...,N. 

71— ^OO 

Lemma 5: If the controller of the form ( |34] | stabilizes the 
linear system in dSOl l. then the corresponding sequence of n- 
codes for the AWGN-MAC with feedback described in (|36] | 
and ( [37] ) achieves MSE exponents 

E,^\og{fi,), j = l,...,N. 

Proof: See Appendix ID] 

Remark 5: Note that the set {/3j} depends only on the sys- 
ten|| and not on the controller. This means that any stabilizing 
controller for the system ( [30l l can be used to design a code 
for the AWGN-MAC, which achieves the same set of MSE 
exponents (log(/3i), . . . ,log(/3Ar)) or, according to Lemma |2] 
the same set of rates. 

In the following, we assume that the system is fixed as 
in ( l30b and we find the stationary controller which minimizes 
the asymptotic power 



1 " 

P := lim -^E(|X,n. 



(38) 



As we show below, the code for the AWGN-MAC which is 
based on this optimal controller is also optimal in terms of 
sum rate among the linear codes for the AWGN-MAC under 
equal power constraints. 

For a controller of the form ( l34l l. let A'x, := Cov(Xi) be 
the covariance of given in ( [32] i. We say that Ky^ is the 
asymptotic covariance if 



lim 



Note that by (|34] |. the asymptotic power of the controller 
can be written as P = fc=i(-^x)jfc- Hence, P represents 
the asymptotic combined power of all the senders in the 
corresponding code for the AWGN-MAC, which also captures 
the correlation between the transmitted signals. Whereas, the 
asymptotic power of each sender in the AWGN-MAC is 
determined by Pj — {Kyi)jj, j ^ 1, . . . , N. 

Lemma 6: The optimal controller of the form (l34l i which 
minimizes the cost (l38T l is stationary and linear, i.e.. 



-cs. 



where 



C ^{B'GB + l)-^B'GA 



(39) 



and G is the solution to the following discrete algebraic Riccati 
equation (DARE) 

G ^ A'GA- A'GB(B'GB + l)-^B'GA. (40) 



Proof: Note that the optimal power P* for the controllers 
of the form (l34l i can be lower bounded as 

P* > P ■ 

_ min 

where Pmin is the optimal cost considering a more general 
controller of the form 



Xi — 7rj(Si) 



(41) 



which picks the scalar action Xi based on the complete 
state Si and is not necessarily separated for different modes 
as in (l34l i. Considering the general control dTTT i we have a 
linear Gaussian quadratic control problem. From the theory of 
LQG [34] we know that the optimal control is linear, i.e.. 



Xi — — CiSi, Gi — \ciiC2i 



■ ■ CNi 



(42) 



and the stationary linear control C ~ Vmvi^oo Ci, which 
minimizes the asymptotic cost ( |38] |, is given by C in ( [39] l. 
Since the solution to this LQG problem given in ( |42] | is of the 
form (|34] | we conclude that 

P* — P ■ 

^ — mm 

and the optimal controller of the form (|34] | is same as the 
optimal controller of the LQG problem. ■ 
By Lemma |6] the optimal controller of the form ( l34b is 
linear. The following theorem provides rate and power analysis 
for the AWGN-MAC code based on a general stationary linear 
control of the form 



Xi ^ -GSi, C = [ciC2 . . . Cat]. 



(43) 



Theorem 3: Consider the stationary linear controller (1431 ) 
for the system ( |30] |. If A — BG is stable, i.e., the eigenvalues 
of A — BC are inside the unit circle, then the corresponding 
code for AWGN-MAC designed based on this linear control 
achieves the rates 

Pj <i?, -log(/3,), j = l,...,7V 

with asymptotic powers 

Pj=c]K,„ j = l,...,iV (44) 



where K is the unique solution of the following discrete 
algebraic Lyapunov equation (DALE) 

K ^ {A- BC)k{A^ BG)' + Q, Q = BB'. (45) 

Proof: First note that by Lemma |5] MSE exponents Ej = 
log(/3j), j = 1, . . . , iV, are achievable and hence, by Lemma|4] 
any rate Rj < Ej, j — 1, N is achievable in each complex 
dimension. 

To find the asymptotic powers, note that given a stationary 
linear controller of the form (l43T l. the closed loop system 
given (|30] | can be written as 



S,; = {A- PC)S,;_i + BZ.^i. 



(46) 



Consider the code corresponding to the linear control ( l43T l. 
where the encoder j transmits 



^From now on, the word system refers to the the one given in )30t . 



Xji = CjSi{j) 
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at time i and let Ki := Cov(Si). Then, the asymptotic power 
Pj is 

n 1 ^ 

= lim -^E(X2)= lim -J2\c,\\K.),,. (47) 

i=l 1=1 

From the closed loop dynamic given in ( |46] l we have the 
following Lyapunov recursion, 

Kn = {A- BC)KnMA~ BCy + Q, Q ^ BB' . 

If A - BC is stable, we know ||34l that 

lim Kn = K>- 0, (48) 

where K is the unique positive-definite solution of the corre- 
sponding discrete algebraic Lyapunov equation (DALE) 

K={A- BC)K{A ~ BC)' + Q. 

Therefore, by ( |48] | and the Cesaro mean theorem, the asymp- 
totic power in Wt\ becomes Pj = \cj\^Kjj. ■ 

B. Kramer Code vs. Optimal LTI Code 
Now consider the symmetric system 

p,=fi j = l,...,N. (49) 

and let the optimal LTI code for the AWGN-MAC be the code 
based on the optimal control given in Lemma|6] The following 
lemma characterizes the matrix G for the symmetric choice of 
matrix A given in ( |49] l. 

Lemma 7: Let 13 j — (3 and uij — e'^^^^^ . The 
unique positive-definite solution G ;^ to the DARE (l40l i 
is circulant with real eigenvalues satisfying — ^Ai_i for 
i = 2, . . . , N. The largest eigenvalue Ai satisfies 

1 + NXi = p^'^ (50) 
(l + Ai(iV- A))^^2(Ar-i)^ (51) 

Proof: See Appendix |E] 

The following theorem shows that under equal power 
constraints for all the senders, the optimal LTI code for 
the system ( |49] l corresponds to the linear code proposed by 
Kramer |7|, which is known to be sum rate optimal within the 
class of generalized feedback codes |8|. 

Theorem 4: Let R{P) be the sum rate achieved by 
Kramer's code with equal power P. There exists a /3 > 1 such 
that the optimal LTI code based on the system ( |49] l achieves 
the sum rate R{P) and satisfies the equal power constraint P 
asymptotically. 

Proof: By Theorem [3] the optimal LTI code based 
on the symmetric system ( |49] l and the optimal controller 
in ( [39] l achieves the symmetric MSE exponent E, i.e, Ej = 
E — log(/3), j = 1,...,N with asymptotic powers Pj = 
CjKjj, j = 1, . . . , N where K is given in ( |45] |. Moreover, by 
Lemma |4] rates Rj < log(/3j), j = 1, . . . , are achievable 
and therefore the sum rate 

R<Nlog{l3) (52) 



is achievable. 

On the other hand, let R{P) denote the sum rate achievable 
by Kramer's code under symmetric per-symbol power P. 
From Q we have 

i?(P) < ilog(l + iVP0(P)) (53) 

where (/'(P) e K. is the unique solution in the interval [1, A^] 
to 

{l + NP<j))^-^ = {l + Pq){N-(j))f . (54) 

Therefore, by picking 

P(P) < N\og{P) < log(l + NP(t>{P)) (55) 

the LTI code can achieve sum rate equal to R{P). 

It remains to show that the asymptotic powers in the optimal 
LTI code satisfy 

Pj<P, j = l,...,iV. (56) 

Towards this end, we first show that P, = Gjj, where G 
is the solution to ( |40] |. Note that the DARE given in ( |40| ) 
can be equivalently written as the following discrete algebraic 
Lyapunov equation (DALE) 

G={A- BCyG{A - BC) + C'C (57) 

and compared with ( |45] |, the diagonal elements of G and K 
can be related as follows 

Gn=^K,,. (58) 

Besides, from ( |44] | we know Pj ~ cjKjj. Hence, we conclude 
that the diagonal elements of G represents the asymptotic 
powers in the optimal LTI code, 

P,=G,,. (59) 

Now we will show that if (3 satisfies ( fSST l. then Gjj < P for 
the corresponding G. First, note that from (ISOl l and ( fSTT l we 
have 

Comparing with ( l54l i and noting that G is circulant we get 

Ai = G,j0(Gjj), j = l,...,N. (61) 
Now if i55[ holds, from ( l50b we know 

i log(l + TVAi) = N\og{l3) < 1 log(l + NPcj^iP)) 
and hence 

Ai < P0(P). (62) 

From (|6T]i and (|62]i we have Gjj(j){Gjj) < P(/>(P), and 
Gjj < P follows from monotonicity of (^(P) in P. Combined 
with (|59]l, then follows. ■ 
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VI. Conclusion 

Communication and control problems have different goals, 
but they both deal with information dynamics and in many 
cases they share similar formulations which can be tackled by 
tools and techniques developed in both fields. Understanding 
the interface between these two theories has become even 
more important in the last decade, as we have witnessed 
technological advancements leading to the convergence of 
computing, communication, and control over networked plat- 
forms of embedded systems. This paper attempted, ad rem, 
one step in this direction. 

Appendix A 
Proof of Lemma[T] 

Note that without loss of generality we can decompose the 
encoding functions fi in ^ into two steps as follows. First, 
based on the past output the encoder picks a function 

h-.M^X. (63) 

Then it transmits 

X, = Um). 

With this separation, the encoder (controller) can be defined 
by the set of mappings 

: ^ {/} 

where {/} is the set of all functions given in (|63] l. Note that 
this new encoder {Tfi}"^i only observes the past output 
and not the message M. Therefore, one can view the feedback 
communication as a control problem where the state is M and 
the controller has partial observations Y^^^. Based on standard 
results in stochastic control ||35l Chapter 6], there is no loss of 
optimality if the mapping tt^ is picked only according to the 
conditional distribution (-11"*^^). Therefore, to find 

Xi it is sufficient to have the message M and its posterior 
distribution 

Appendix B 
Proof of Lemma|2] 

Consider a sequence of n-codes for M e (0, 1) such that 

E < lim inf — - log(D(") ) . (64) 

We show that a rate R < E is achievable. Consider 

P{\M - M| > i • 2-"-") < 4 • 22"« • (65) 
< 4 • 2^"-^ • 2-2»(£-«.^) (66) 
= 4 . 2-2»(s--R-E-) (67) 

for some e„ where e„ — > as n — > oo, where ( |65l l follows 
from Chebyshev inequality and ( |66] l follows from ( |64] i. If i? < 
E, from (|67] | we have 

P{\M- M\ > i • 2-"-") ^ 

as n — > oo and the decoder can pick intervals A„ such that 

|A„| =2-"« 



and 

P{M ^ A„) ^ as n ^ oo. (68) 

By Lemma II. 3 in |27|, if condition (|68] | is satisfied then for 
all n, we can map the message set A^„ = {1, . . . , 2"^} into 
a set of message points A^„ e (0, 1) in the unit interval such 
that the minimum distance between the two message points is 
Therefore, using the interval decoder described above 
which satisfy (|68]l, Pi"^ as n — oo and rate R is 
achievable. 

To complete the proof, let rate R be achievable by a given 

p(") 

sequence of n-codes such that lim„^oo 2-2nJt — ^ 0. For 
each n, we divide the unit interval (0, 1) into 2~"^ equal 
sub-intervals and map the continuous message M G (0, 1) to 
the discrete message Mn e Mn = {!,..., 2"^} according 
to the sub-interval M lies in. To communicate M, we send 
the corresponding A/„ using the given n-code, and we pick 
the middle point of the interval corresponding to the decoded 
message M„ as the estimate of M G (0, 1). The MSE for M 
can be (loosely) upper bounded by 2~^"^ if the message Af„ 
is decoded correctly, and by 1 in case of an error. Hence, 

< pin) _^ (1 _ p{n)^2-^nR (gg^ 
, , p(") 

From (|69Jl and the assumption lim„^oo 2-1inJi — > we have 

lim inf — -D^''^ > R 
and the MSE exponent E = R is achievable. 

Appendix C 
Proof of Lemma[3] 

We show that 

/i(F") > h{Z") + I{So;Y") 

with equality if and only if Xi is a function of (5*0, for 
i = 1, . . . , n. 
Consider 

h{Y") ^ h{Y"\So) + I{So;Y") 

n 

^Y.h{Y,\Y^'\S^)+I{S^-Y-) 

n 

> h{Y,\Y^'-\X\ So) + I{So; Y") (70) 

n 

= ^ h{Y,, Z,\Y''-\X\ 5o) + /(^o; Y^) (71) 

n 

= J2 h{Z,\Y'-\X\ 5o) + /(5o; F") 

n 

= J2 h{Z,\Y'-\X\ So, Z'-^) + I{So; F") (72) 

n 

= ^/i(Z,|Z*-i)+/(5o;r") (73) 

i=l 

^h{Z")+I{So;Y") (74) 

where 
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• the inequality dTOl i comes from the fact that condi- 
tioning reduces entropy, and equahty holds iff Yi ^■ 
{So, y*^^) Xi form a Markov chain for i = 1, . . . ,n, 
and since ~ Xi + Zi this Markov chain holds iff Xi 
is a deterministic function of {Sq,Y^^^), 

• the equality (fTTT i and ( |72] | follows from the fact that the 
channel is invertible, and 

• the equality ( |73] l follows since — > Z*^^ ^ 

5*0) form a Markov chain. 

Appendix D 
Proof of Lemma|5] 

The system dynamics given in (|30] | can be rewritten as 

= A^So + S, (75) 

where is given in dJTj i. Plugging i = n and So = M 
into (|75l l and multiplying both sides by A^" we have 

A-"S„ = M + A-"S„ 
= M-M„ 

where M = — A~"'S„ is the estimate of the message (see 
Definition |3]l. The covariance matrix of the error vector e„ := 
M - M„ = A^'" Sn can be written as 

Cov(e„) = A-"KnA'^" 

where Kn := Cov(S„) is the covariance matrix of S„. 
Therefore, the MSE for the sender j is 

i^]"^ = E(|e„(j)P)=/3-'"(if„),,. (76) 

By the assumption of stability limsup„^g^(i4r„)jj < 00 and 
from (|76] | we have 



liminf log(-D 



Hence, the MSE exponents Ej = log(/3j) for j = 1, , 
are achievable. 

Appendix E 
Proof of LemmaIt] 
Let A and B be of the form dSTT i with symmetric parameters 
Pj — 13 and LOj = g^'^^-i „ ^ Then we show that the 
unique positive-definite solution G y Q the following discrete 
algebraic Riccati equation (DARE) 

G = A'GA - A'GB{B'GB + ly^B'GA {11) 

is circulant with real eigenvalues satisfying Xi = ^A,;_i for 
i — 2, . . . , N, and the largest eigenvalue Ai satisfies 

1 + A^Ai = (3^^ 



1 



Gil 



Note that any circulant matrix can be written as QAQ', where 
Q is the N point DPT matrix with 



-lU-l){k-l}/N 



(78) 



and A = diag([Ai, . . . , Xn]) is the matrix with eigenvalues on 
its diagonal. We show that the circulant matrix G = QAQ' 
with positive Xj > 0, such that Xj — Xj-i/0^ for j > 2, 
satisfies the Riccati equation ( iTTl ). Plugging QAQ' into ( iTTb 
and rearranging we get 

A - {Q'AQ)A{Q'AQ)' - {{Q' AQ) A (Q'B)) 

{1 + B'QAQ'B)-\{Q'AQ) A (Q'B))'. 

For this symmetric choice of A we have 



/ 1 




YAQ = /? 





V 1 



1 





f 




\ 









,Q'B = 










[ 








Hence, 



{Q'AQ)A{Q'AQy = (3^ 



{Q'AQ)A{Q'B) = 



( X2 


V 








A3 



\ 





Ai / 



V l3XiVN J 



and the Riccati equation is transformed into N diagonal 
equations. The first — 1 equations are 



Xj — (3'^Xj+i, 



and the A^-th equation is 



I3^X- 



~ 1 + A^Ai' 



(79) 



(80) 



From (|79]l we see that Ai is the largest eigenvalue and Ajv = 
^-2(Ar-i) -^^ Combining with we get 



(1 + A^Ai) =/3' 



2^ 



(81) 



Hence, Ai is real and so are Aj, j = 2, . . . , A^. 

On the other hand, we consider the diagonal equations of 
the original DARE in ( iTTl i. First, note that from the form of 
Q in dTST l, Ai — a\ where 

N 

'^J '■= 

fe=l 

Moreover, since G is circulant we know that cjj = g\ and 
Gyj = Gil for all j = 1, . . . , A^, and {l + B'GB) = l + A^Ai. 
Hence, the diagonal equations of dTTl l. i.e.. 



^33 — P^Gjj 
are equivalent to 



f3^ 



(1 + B'GB) 
1 + A^Ai 



j = l,...,iV (82) 



(83) 
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and by pT|) we have 

(l + Ai(7V-i^))=/?2(^-i). (84) 
Combining (|79] |. dsT] ). and ( |84| | completes the proof. 
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